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ABSTRACT. In this paper, we re-visit the combinatorial error model of Mazumdar et al. [3] that 
models errors in high-density magnetic recording caused by lack of knowledge of grain boundaries 
•_ ■ in the recording medium. We present new upper bounds on the cardinality /rate of binary block codes 

that correct errors within this model. 
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1. Introduction 



The combinatorial error model studied by Mazumdar et al. ||3] is a highly simplified model of an 
error mechanism encountered in a magnetic recording medium at terabit-per-square-inch storage 
densites H, Q. In this model, a one-dimensional track on a magnetic recording medium is divided 
into evenly spaced bit cells, each of which can store one bit of data. Bits are written sequentially 
into these bit cells. The sequence of bit cells has an underlying "grain" distribution, which may be 
described as follows: bit cells are grouped into non-overlapping blocks called grains, which may 
consist of up to b adjacent bit cells. We focus on the case b = 2, so that a grain can contain at most 
two bit cells. We define the length of a grain to be the number of bit cells it contains. 

Each grain can store only one bit of information, i.e., all the bit cells within a grain carry the 
same bit value (0 or 1), which we call the polarity of the grain. We assume, following 0, that in 
the sequential write process, the first bit to be written into a grain sets the polarity of the grain, so 
^ ■ that all the bit cells within this grain must retain this polarityQ This implies that any subsequent 

attempts at writing bits within this grain make no difference to the value actually stored in the bit 
cells in the grain. If the grain boundaries were known to the write head (encoder) and the read 
head (decoder), then the maximum storage capacity of one bit per grain can be achieved. However, 
in a more realistic scenario where the underlying grain distribution is fixed but unknown, the lack 
of knowledge of grain boundaries reduces the storage capacity. Constructions and rate/cardinality 
bounds for codes that correct errors caused by a fixed but unknown underlying grain distribution 
have been studied in the prior literature [0, 10. In this paper, we present improved rate/cardinality 
upper bounds for such codes. 

The paper is organized as follows. After providing the necessary definitions and notation in 
Section [2 we derive, in Section an upper bound on the cardinality of t-grain-correcting codes, 
for t = 1, 2, 3, using the fractional covering technique from [ Q. We conjecture that the upper bound 
in fact holds for all t. We further conjecture that the same technique should yield a stronger upper 
bound, and we report some progress towards this in Section @] An information-theoretic upper 
bound on the maximum rate asymptotically achievable by codes correcting a constant fraction of 
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grain errors is derived in Section [5] We conclude in Section [6] with some remarks concerning the 
bounds. Some of the technical proofs from Sections [3] are given in an appendix. 

2. Definitions and Notation 

Let £ = {0, 1}, and for a positive integer n, let [n] denote the set {1,2,..., n}. A track on 
the recording medium consists of n bit cells indexed by the integers in [n]. The bit cells on the 
track are grouped into non-overlapping grains of length at most two. A length-2 grain consists of 
bit cells with indices j — 1 and j, for some j G [n]; we denote such a grain by the pair (j — 1, j). 
Let E C {2, . . . , n} be the set of all indices j such that (J — 1, j) is a length-2 grain. Since grains 
cannot overlap, i£ contains no pair of consecutive integers. 

A binary sequence x = (x\, . . . , x n ) G S n to be written on to the track can be affected by 
errors only at the indices j G E. Indeed, what actually gets recorded on the track is the sequence 
y = (yi,---,Vn), where 

= {xj-i tfjeE 
^ [ Xj otherwise. 

For example, if x = (000101011100010) and E = {2,4, 7,9, 14}, then y = (000001111100000). 
Note that the set E completely specifies the positions and locations of all the grains (both length- 1 
and length-2) in the track. We will call this set the grain pattern. It is assumed that the grain pattern 
is unknown to both the write head and the read head. The effect of the grain pattern E on a binary 
sequence x G X n defines an operator 4>e '■ S n — > S n , where y = 4>e(^-) is as specified by £0) 
above. 

For integers n > 1 and t > 0, let £ n :t denote the set of all grain patterns E with \E\ < t. In other 
words, £ n: t consists of all subsets E C {2, . . . , n} of cardinality at most t, such that E contains no 
pair of consecutive integers. For an x G S n , we define 

$ t (x) = {^(x) : E G £ n , t }. 

Thus, Q t (x) is the set of all possible sequences that can be obtained from x by the action of some 
grain pattern E with \E\ < t. Two sequences xi and X2 are t-confusable if ^(xi) n < I ) i(x2) ^ 0. 
A binary code C of length n is said to correct t grain errors, or be a t-grain-correcting code, if 
no two distinct vectors xi, X2 G C are f-confusable. Let M(n, t) denote the maximum cardinality 
of a t-grain-correcting code of length n. Also, for r G [0, |], the maximum asymptotic rate of a 
\rn\ -grain-correcting code is defined to be 

R(t) = limsup — log 2 M(n, |~rra]). (2) 
n— >oo Tl 

A grain pattern E changes a sequence x to a different sequence y iff Xj-i ^ Xj for some j G E, 
i.e., the length-2 grain (j — 1, j) straddles the boundary between two successive runs in x. Here, a 
run is a maximal substring of consecutive identical symbols (0s or Is) in x. A run consisting of 0s 
(resp. Is) is called a 0-run (resp. l-run). The number of distinct runs in x is denoted by r(x). 

A convenient means of keeping track of run boundaries in x is via its derivative sequence, x': 
for x = (xi, . . . , x n ), the sequence x' = (x' 2 , • • • , %' n ) i s defined by x'j = Xj-\ © Xj, j = 2, . . . , n, 
where © denotes modulo-2 addition. The Is in x' identify the boundaries between successive runs 
in x. Thus, w(x') = r(x) — 1, where u(-) denotes the Hamming weight of a binary sequence. 

Let supp(x') = {j : x'j = 1} denote the support of x'. For x G S n , the sequences y G 3>t(x) 
are in one-to-one correspondence with the different ways of selecting at most t non-consecutive 
integers^ from supp(x') to form a grain pattern E G £ n ,t- Thus, | < l ) t(x)| counts the number of ways 



'A sequence or set of non-consecutive integers is one that does not contain a pair of consecutive integers. 
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of forming such grain patterns. This count can be obtained as follows. Let £±,£2, ■ 
lengths of the distinct 1-runs in x', and define the set 



denote the 



(3) 



T={(t 1 ,...,t m )GZ™:^t j <t}, 

3=1 

where Z + denotes the set of non-negative integers. In the above expression, tj represents the 
number of integers from the support of the jth 1-run that are to be included in a grain pattern E 
being formed. The number of distinct ways in which tj non-consecutive integers can be chosen 
from the tj consecutive integers forming the support of the jth 1-run is, by an elementary counting 
argument, equal to ( £j Thus, 



!**(*) 



{h,...,tm)eTj=l v 
Simplified expressions can be obtained for small values of t 



(4) 



Proposition 1. For x £ E n , let uj = w(x') denote the Hamming weight of the derivative sequence 
x'. Also, let m be the number of 1-runs in x'. 

(a) |*i (x) I = 1+uj = r(x). 

(b) |* 2 (x)| =l+m+{%). 

(c) |*3 (x) I = 1 + mi + m(w - 3) + (£) - (£) + 2u, where m± denotes the number of 1-runs 
of length 1 in x'. 

Proof, (a) While the expression for |*i(x)| can be directly obtained from ((U, it is simpler to ob- 
serve that the set *i(x) consists of the sequence x itself, and the uj distinct sequences in the set 
{<^>e(x) : E = {j} for some j G supp(x')}. 

(b) For t = 2, it is easy to see that the expression in dU simplifies to 

m m /p. _ i\ 

Wx)hl + & + E 3 2 )+ E 

7=1 7 = 1 ^ ' (i,i):i<i 



We then have 



|* 2 (x) 



l + m + E(^-l) + E 

3=1 3=1 

l + m + E(M+ E 

3=1 ' J (i,j):i< 
_. m 

l + m+-E(^-^)+ E 



{i,3)'-i<3 
1 



+ E 44 

{i,j)-i<3 



£i£j 



w 3 



(i,j)--i<3 



1 + 171 + - 



1 + m + 



E^ 2 + E ^-E^ 



j=l 

■3) -£4 



1 + m + 



UJ 



the last equality being due to the fact that uj = Y^j=i c j- 
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(c) For t = 3, the expression in (O can be written as 

i* 3 (*)i=i* a (x)i+ e w*+e(V)(e^)+e(V)- 

(i,j,k):i<j<k i=l ^ ' i=1 

From here on, straightforward algebraic manipulations lead to the expression given in the statement 
of the proposition. We omit the details, noting only that m\ enters the picture when we write the 
last term above as 

^ (4-2X4-3X4-4) ] £ L 

1=1 ^ i:L = l 



The extra term XW=l 1> which equals mi, accounts for the fact that the expansion of ( * 3 ) as 
(ii -2) -3) (d -4) is invalid when 4 = 1; by convention, (£) = when a < 0. □ 

3. An Upper Bound on M(n, i) 

In this section, we explore the applicability to grain-correcting codes of a technique used by 
Kulkarni and Kiyavash 1 1] to derive upper bounds on the cardinalities of deletion-correcting codes. 

A hypergraph % is a pair (V, X), where V is a finite set, called the vertex set, and X is a family 
of subsets of V. The members of X are called hyperedges. A matching of % is a pairwise disjoint 
collection of hyperedges. A (vertex) covering of % is a subset T C V such that T meets every 
hyperedge of Ti, i.e., T n X ^ for all X G Af. The matching number v(%) is the largest size of 
a matching of Ti, while the covering number, t(H), is the smallest size of a covering of %. 

The problems of computing the matching and covering numbers can be expressed as a dual 
pair of integer programs. This is done via the vertex-hyperedge incidence matrix, A, of %, which 
is defined as follows. Let vi,V2, ■ ■ ■ , v\y\ and X\,X2, . . . , X\x\ be a listing of the vertices and 
hyperedges, respectively, of H. Then, A = (Aij) is the |V| x \X\ matrix with 0/1 entries, with 
Aij = 1 iff Vi G Xj. It is easy to verify that 

v(H) = max{l T z : z G {0, l} 1 * 1 , Az < 1} (5) 

and 

t(H) = min{l T w : w G {0, l} lVl , A T w > 1}, (6) 
where 1 denotes an all-ones column vector. Note that the linear programming (LP) relaxations of 
©, 

u f (H) = max{l r z : z > 0, Az < 1}, (7) 

and ©, 

Tf (U) = min{l T w : w > 0, A T w > 1}, (8) 
are duals of each other. By strong LP duality, we have Vf(H) = Tf{%), and hence, 

v{H) < vf{H) = T f (H) < t(H). (9) 

The quantities Vf(7i) and Ty('H) are called the fractional matching number and fractional cov- 
ering number, respectively, of the hypergraph H. Any non-negative vector w such that A T -w > 1 
is called a fractional covering of T~L. To put it in another way, a fractional covering is a function 
w : V — > M+ such that J2 v ex w { v ) — 1 f° r an X £ X. The value of a fractional covering w is de- 
fined to be |w| := J2 v eV w l v )- From the inequality v(H) < Tf{H) in (O, we see that v(W) < |w| 
for any fractional covering w of %. We use this to suggest an upper bound on the largest size, 
M(n, t), of a t-grain-correcting code of blocklengfh n. 

Consider the hypergraph H n ,t = (V, X), where V = S n , and X = {$ t (x) : x G S n }. Note that 
j/(W n) t) = M(n,t); thus, fractional coverings of T-l n ^ yield upper bounds on M(n,t). Bounding 



A fractional matching is correspondingly defined, but we will have no further use for this concept. 
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the size of packings in this way has been extensively used in combinatorics, see e.g. 0. Taking 
inspiration from fl], we consider the function w t : T> n — > R + , defined for x G S n as 

w ' (x) =*W (10) 

For t = 1,2, 3, we can prove that w t is a fractional covering of 1~L n j, and conjecture that this is in 
fact the case for all i > 1. 



Conjecture 3.1. For all positive integers n and t, the function w< defined in ( 1701 ) is a fractional 
covering ofH n> t, i.e., for all x G S n , 

ye<l>t(x) 



Therefore, 



M(n,i)<K|=£--j_. (12) 



Our proof of ([TTT > for t = 1, 2, 3 relies on an understanding of the relationship between |<3?t(x)| 
and |3>i(y)| for y £ *t(x). Recall, from (0]), that |$t(x)| depends only on the lengths of the 1-runs 
in x'. Thus, we need to understand how the distribution of Is changes in going from x' to y'. 

3.1. Effect of Grains on the Derivative Sequence. Recall that Is in x' correspond to run bound- 
aries in x. We say that a (length-2) grain acts on a 1 in x' if it straddles the corresponding run 
boundary in x. We need to distinguish between two types of Is in the derivative sequence x'. A 
trailing 1 is the last 1 in a 1-run, while a non-trailing 1 is any 1 that is not a trailing 1. Grains act 
on trailing Is in a manner different from non-trailing Is. 

A segment of x' that contains a trailing 1 is of the form *10*, or *1 in case the trailing 1 is a 
suffix of x. Up to complementation, the corresponding segment of x is of the form *011* or *01. 
A grain acting on the trailing 1 in x' straddles the 01 run boundary in x. In the sequence y obtained 
through the action of this grain, the segment under observation becomes *001* or *00, and the 
corresponding segment of the derivative sequence y' is *01* or *0. 

On the other hand, a non-trailing 1 in x' belongs to a segment of the form *11*; the first 1 shown 
is the non-trailing 1 under consideration. Again, up to complementation, the corresponding segment 
in x is of the form *010*. A grain acting on the non-trailing 1 in x' straddles the 01 run boundary 
shown in x. This grain causes the segment being observed to become *000* in y, and hence *00* 
in y'. 

To summarize, the action of a grain on a trailing 1 converts a segment of the form *10* or *1 in 
x' to *01* or *0 in y', and a grain acting on a non-trailing 1 converts a segment of the form *11* 
in x' to *00* in y'. It should be clear that the bits depicted by *s on either side of these segments 
remain unchanged by the action of the grain. Note, in particular, that a grain acting on a 1 in x' does 
not increase the Hamming weight of x'. A grain acting on a trailing 1 either leaves the Hamming 
weight of x' unchanged, or reduces it by 1; in the case of a non-trailing 1, the Hamming weight of 
x' is always reduced by 2. 

Finally, when dealing with a grain pattern containing t > 1 length-2 grains, since the grains are 
non-overlapping, the actions of individual grains can be considered independently. 

3.2. Proof of (SB fort = 1,2,3. Consider t = 1 first. For any y G $i(x), we have w(y') < w(x') 
by the discussion in Section [37X1 and hence, |$i(y)| < | < I > i(x)| by Proposition Q] Therefore, 



E 



1 1 



|3>i(y)l ' l^i fx 

ye<fi(x) 1 iVJVI ye*i(x) 1 u 



which proves (fTT]) for t = 1. 
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The simple argument above does not extend directly to t > 2, the reason being that it is no 
longer true in general that I'&tfy)! < | <J>t (x) | for y G ^t(x). For example, consider x = 0100, 
and note that $ 2 (x) = {0000,0100,0110}. Take y = 0110 G $ 2 (x), and verify that $ 2 (y) = 
{0110,0010,0111,0011}. Thus, |$ 2 (y)| > |*2(x)|. 

To prove (fTTT t for t = 2,3, we show that the sequences y G ^t(x) that violate the inequality 
^(y)! < l*t( x )l can be dealt with by suitably matching them with sequences that satisfy the 
inequality. To this end, for a fixed x G £™, let us define F t (x) = {y G ^(x) : |3><(y)| > |$« (x)|} 
and Gt(x) = {y G ^tfx) : |<3?i(y)| < |$t(x)|}. We will construct a one-to-one mapping p : 
Fj(x) — > G((x) such that for all y G i*i(x), we have 

> +1 1 2 » (13 ) 



l*i(y)l l*i(p(y))l " l*t(* 

The mapping p will be referred to as a pairing. Let -Pt(x) = p(i*i(x)) denote the image of p, and 
let Q t (x) = G t (x) \ P t (x). Thus, |P t (x)| = |F t (x)|, and $ t (x) = F t (x) U P t (x) U Q t (x). Then, 



I ^ I i 



^ |$*(y)| ^ |*t(y)| + ^ l^(y)l + ^ KMy) 



l*t(y)l ^ l*t(p(y))l ^ l*t(y) 

y£-Pt(x) UJVI yGF t (x) 1 yeQ t (x) 1 UJ 7 

V 2 , ^ 1 



,$t(x)| ^ |$ t (x) 
yeFt(x) 1 twl yGQ t (x) 1 tv ; 



1 



(2|P t (x)| + |Q t (x)|). 



I*t(x) 

The last expression above is equal to 1 since 2|F t (x)| + |Q t (x)| = |F(x)| + |i"t(x)| + |Qt(x)| = 
|<J>t(x)|. Thus, the construction of a pairing satisfying (H~3T > is sufficient to prove dTTT >. and hence, 
(fT2l ). Such a pairing can indeed be constructed for t = 2, 3, and we give a proof of this in Appen- 
dix A. 

In summary, we have obtained the following result. 
Theorem 2. For any positive integer n and t = 1,2,3, we have 

M(n,t)< Y, |^)| • 



For t = 1, an exact closed-form expression can be derived for ^ x i^^yy- Indeed, 

E l W \ -» 1 _ \ ^ \ £ 

1*1 fx) I ~ ^ |r(x)| ~ 2^ r 

(J) oJn-lU (j Al/n 

\ r — 1 / r n 

r=l x ' r=l 

2,_ 1 



-(2 n -l) = -(2 n+i -2). 
n n 

Equality (a) above is by virtue of Proposition [TJ (b) is due to the fact that the number of x G S n 
with r(x) = r is equal to twice the number of x' G S"" 1 with cj(x') = r — 1; and (c) uses the 
identity = £(")• Thus, we have 

Corollary 3. M(n, 1) < ^(2 n+1 — 2) for all positive integers n. 
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For t = 2,3, analogous closed-form expressions for the upper bound in Theorem|2]do not appear 
to exist. However, using Proposition [Q the bounds can be expressed in a form more convenient for 
numerical evaluation. 



Corollary 4. With the convention that (_M equals 1 if a 
following bounds hold: 

n— 1 min{cj,n— uj} 

(a) M(n,2) < 2 • j] £ 

u=0 m=0 

n -l mm{w,n-w} m . 

(b) M(„,3)<2.£ 2 W™ 

w=0 m=0 mi=0 V 

^(mi, m, cj) = 1 + mi + m(w — 3) + 



-1, arcc/ equals otherwise, the 



uj - 
m 



n — uj 
m 

uj - 
m — 



1 



1 



m 

mi — 1 

' + 2w. 



n — uj\ 1 
m / ^s(mi,m, uj) 



where 



3J \2j 

Proof. The expressions for the upper bounds are simply alternative ways of expressing ^ x pgrj^ji 

using Proposition [T] The factor 2 in the bounds arises from the fact that each x' G S n_1 is the deriv- 
ative of exactly two distinct sequences x G S". In the bound for M(n, 2), the term 



\m—l) 



is 



the number of sequences x' G S n 1 with Hamming weight cj and exactly m 1-runs. Analogously, 
in the bound for M(n, 3), the term ( ™ ) (^1™^) ( n ~ w ) is the number of sequences x' G S™" 1 
with Hamming weight uj and exactly m 1-runs, of which exactly m\ runs are of length 1. □ 



n 

t 


2 


3 


4 


5 


6 


7 


8 


9 


10 


15 


20 


1 


3(2) 


4(4) 


7(6) 


12(8) 


21 (16) 


36 (26) 


63 (44) 


113 


204 


4368 


104857 


2 






7(4) 


11(8) 


17 (10) 


27 (16) 


43 (22) 


70 


114 


1552 


26418 


3 










17 (8) 


26(16) 


41 (18) 


65 (32) 


101 


1024 


12510 



Table 1 . Some numerical values of the upper bound of Theorem[2] rounded down 
to the nearest integer. Within parentheses are the corresponding lower bounds from 
Table I of 0. 



Table [TJ lists the numerical values of the bounds in Corollaries [3] and |4] for some small values 
of n. Two other upper bounds on M(n, t) exist in the prior literature, namely Corollary 6 of ||3] 
and Theorem 3.1 of Q. Numerical computations for n < 20 show that our bounds above are 
consistently better than the bounds obtained from [5, Theorem 3.1]. On the other hand, the bound 
of (3] Corollary 6] may be better than our bound for small values of n: for example, the bound in 
j3l yields M(10, 2) < 92. However, our bound is better for all n sufficiently large: for t = 1, our 
bound is better for all n > 8; for t = 2, our bound wins for n > 13. 

3.3. Some Remarks on the Proof for Arbitrary t. We outline here one possible approach to 
proving Conjecture 13. ll for general t. To prove (flTT ). it is enough to show that for each x G S n , 

£ Q**(y)| - |**(x)|] < 0. 

ye$t(x) 

Indeed, the above inequality is equivalent to showing that the arithmetic mean pgrj^i S ye $ t ( x ) l^t(y) 
is at most |$ t (x)|. If this is true, then by concavity of the function f(x) = we would have 
1 y> 1 1 1 

HM^ Ji, 1*551 " ^ £ \My)\ ^5)1 

ye*t(x) 

which is the desired inequality (TTTT t. The arguments given in Appendix A for t = 2, 3 essentially 
follow this approach. 
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4. A Stronger Conjecture 

We in fact conjecture that a bound tighter than that of Conjecture 13. II may hold. To state this 
bound, let us define V(n,t) to be the cardinality of a Hamming ball of radius t in E n : 



V(n,t) 



t 

E 

3=0 



Note that for any x G S n , we have |$t(x)| < V(uj(x!),t), where w(x') is the Hamming weight of 
the derivative sequence x'. This is because V(uj(x.'),t) counts the number of ways that a pattern of 
up to t length-2 grains could affect x if the grains were not constrained to be non-overlapping. 
We conjecture that the function w t : S n — > M.+, defined by 

*< (x) = vd^) 

is a fractional covering of the hypergraph H n> t. Note that 

^ n— 1 

xGS 



^ n V{u>{x!),t) 



uj=0 



n — 1 

id 



v( u ,ty 



since 2( n ^ 1 ) is the number of sequences x G E n whose derivative sequence x' has Hamming 
weight u). 



Conjecture 4.1. For all positive integers n and t, and for all x G S n , we have 

1 



E 

ye*t(x) 



^(w(y'),t) 



> l. 



Therefore, 



M(n, t) <2Y 1 ( n - 1 ) 

LU = V 7 



(14) 



(15) 



Note that (Q3]> is tighter than ([12]), since |$t(x)| < V(w(x'), t). For t = 1, the two bounds are 
identical by virtue of Proposition (Ha); hence, in this case, Theorem [2] shows that the conjecture is 
true. We can also prove that the conjecture holds for t = 2, 3. 

Theorem 5. For any positive integer n and t = 1,2,3, we have 



M(„,t)<2g("- 1 ) 



1 



it - V 
u; ) V(u, t) 



Table |2] lists the numerical values of the bound in the above theorem for some small values of n. 
Again, for the sake of comparison, the corresponding lower bounds from Table I of are given in 
parentheses. We do not tabulate the row for t = 1 as this is the same as that in Table [TJ 



n 

t 


4 


5 


6 


7 


8 


9 


10 


15 


20 


2 


7(4) 


10 (8) 


15 (10) 


24 (16) 


39 (22) 


62 


102 


1406 


24306 


3 






15(8) 


23 (16) 


34(18) 


53 (32) 


81 


800 


9921 



Table 2. Some numerical values of the upper bound on M(n, t) of Theorem [5 
rounded down to the nearest integer. 



In the remainder of this section, we give a proof of Theorem [5] 
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4.1. Proof for t = 2. Fix x G £ n , and let a; = w(x'). We want to prove ([141 for t = 2. From 
the discussion in Section l3Tl we know that for any y G ^(x), the Hamming weight of y' must lie 
between w — 4 and w. For j = 0, 1, 2, 3, 4, let A, be the number of sequences y 6 $2(x) sucn that 
u(y r ) = oj-j. 

Lemma 6. Le? to denote the number of 1-runs in x', and let mi be the number of these that are of 
length 1. Then, 

(a) Aq + A x = l + m+ (™); 

(b) A 2 + A3 = (oj — m)(m + 1) — (to — mi); 

(c) A 4 = ( w - m ) - (oj - m) + (m - mi). 

Proof. Let y = 0s(x) for some E G £ nj2 . Write x' = (x' 2 , • • • ,x' n ). Note that x' contains m 
trailing Is and oj — to non- trailing Is. 

(a) We have oj(y') = oj or oj — 1 iff each j £ E acts upon a trailing 1 of x'. Let J = {j G 
{2, . . . , n} : is a trailing 1} be the positions of the trailing Is in x'. Thus, \J\ = m, and J does 
not contain consecutive integers. The sequence y is counted by Aq + A\ iff E C J. The number 
of such grain patterns E is precisely 1 + m + (™) . 

(b) We have oj(y') = oj — 2 or oj — 3 iff exactly one j G E acts upon a non-trailing 1 in x'. 
Thus, for a grain pattern E G £ n> 2 to contribute to A 2 + ^3, exactly one grain in the pattern must 
act on a non-trailing 1. The number of such grain patterns E with \E\ = 1 is precisely oj — m. It 
remains to count the number of grain patterns E of cardinality 2 that contribute to A 2 + A3. Let 

= {i, j}, where i and j are the grains acting on a trailing 1 and a non- trailing 1, respectively. If 
i acts on an "isolated" 1, i.e., a 1-run of length 1, then j can act on any of the oj — m non-trailing 
Is. On the other hand, if i acts on a trailing 1 from a 1-run of length at least 2, then j can be any of 
the non-trailing Is except for the 1 at position i — 1. It follows that the number of grain patterns of 
cardinality 2 contributing to A2 + A3 equals mi(w — m) + (to — m\)[oj — to — 1). Thus, 

A2 + A3 = {oj — to) + m\(oj — to) + (to — m\){oj — to — 1) 
= (oj — m)(m + 1) — (to — toi). 

(c) This part follows from the fact that A4 = | $2 (x) | — j=o A? > usm § the expression for 
I <3?2 (x) I given in Proposition [lib). □ 

We are now ready to prove (fl4l . For convenience, we use V(a) to denote 1 + a + . We start 
with 

x - 1 A) + ^i A 2 + A 3 Ai 



7(^,2) " V» F(o;-2) V>-4) 

\{oj-m){oj + m + l) A 2 + A z A 4 

V(oj) V(oj - 2) V(oj - 4) ' 1 ^ 

The equality above simply uses the fact that V(oj) — V(m) = ~(uj — m){oj + to + 1). Now, note 
that 

A 2 + A 3 A 4 (oj-m)(m + l) ("" 2 m ) - (oj - to) 

y(w -2) F(w - 4) - V(w - 2) V(w - 4) 

(w — to)(to + 1) \(oj — m)(oj — m — 3) 
y(w - 2) + V(oj - 4) ' 
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Therefore, carrying on from (|T6l , we have 
1 



ye*2(x) 



> 1 + (uj — m) 



> 1 + (uj — m) 



m + 1 o 

+ 2 



\{oj — m — 3) i(w + m,+ l) 



l + -(w-m) 



V(w-2) ' V(w-4) 

m + 1 + \(uj — m — 3) |(o; + m + 1) 
V(w - 2) 

uj + m — 1 w + m + 1 



(17) 



V(u) 



V(uj - 2) 



V(uj) 



1 + 



| (a; — m) 



[w 2 + 2mw - (m + 3)]. 



(18) 



If w 



V(uj - 2)V(uj) 

m, then (fT71) proves (fl4l) . Else, if w > m + 1, then the term within square brackets in 



181) can be further bounded as follows: 



UJ 



2mui 



(m + 3) > (m + l) 2 + 2m(ra + 1) - (m + 3) 
= 3m 2 + 3m — 2, 



which is positive for m > 1. Thus, again, we have (fT4b . which completes the proof of the t = 2 
case. 

4.2. Proof for £ = 3. The approach is the same as that for t = 2, but the computations are more 
cumbersome. So, let x G S n be fixed, and let uj = w(x'). The Hamming weight of y', for any 
y € ^(x), lies between u — 6 and uj. So for j = 0,1, ... ,6, let Bj be the number of y € ^(x) 
such that u(y') = u — j. 

Lemma 7. Let m denote the number of 1-runs in x', and let rrii, i = 1,2, be the number of these 
that are of length i. Then, 

(a) B {j + B 1 = l+m+{™) + {™); 

(b) B 2 + B 3 = (uj - m)(l + m + (™)) - m(m - mi); 

(c) S 4 + = (1 + m) [( u 2™) - (w - m)] - (w - 2m - 3)(m - mi) - m 2 ; 

(d) Bq = ("j™) — (uj — m)(uj — m + 1) + (uj — m)(m — mi) + 4(w — 2m + mi) + m% 



Proof Let y = $e(x) for some E 1 G <S nj 3. 

(a) This is proved by an easy extension of the proof of Lemmata). 

(b) For a grain pattern E G S n> 2 to contribute to B 2 + B%, exactly one grain in the pattern 
must act on a non-trailing 1. The number of such grain patterns E with \E\ < 2 is equal to 
(uj — m) (m + 1) — (m — m\ ) by LemmaOb). Extending the arguments in the proof of LemmalJb), 
we determine that the number of grain patterns of cardinality 3 that contribute to B 2 + B% is equal 

to ( m 2)(uJ -m) + m l (m- mi) (a; - m - 1) + ( m " 2 mi )(a; - m - 2). Thus, 



Bo + -B3 



(uj — m)(m + 1) — (m — mi) + 



mi 



(uj — m) 



. m — mi . 

+ m\(m — m\)(uj — m — 1) + \(uj — m — 2), 



which simplifies to (uj — m) (l + m + \T)) — m(m — mi). 

(c) This part follows from the fact that B4 + B5 = | $3 (x) | — Ylj=o — Bq, using the expression 
for |$3(x)| given in Proposition (He). 

(d) Bq equals the number of grain patterns E G £ n $ with \E\ =3, in which all three grains act 
on non-trailing Is of x'. The sequence x' has m — m\ 1-runs of length at least 2; let l\, ... , £ m - mi 
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1 



m—mi 



+ E 



denote the lengths of these runs. Then, for i = 1, . . . , m — mi, l i = I j — 1 denotes the number of 
non-trailing Is in these runs. With this, we can write 

B « = E c« + E 

(i,j,k):i<j<k i=l 

From this, straightforward algebraic manipulations yield the expression in the statement of the 
lemma. The algebra here is analogous to that needed to prove Proposition [TJc). □ 

For convenience, we define U(a) to be 1 + a + + (3). We then have 

2 



E F(w(y'),3) " E - 



Bj + g^+i 



U(u-2j) U(u-6)' 



(19) 



The aim is to show, using Lemma[7J that the right-hand side of the above inequality is at least 1. We 
dispose of an easy case first. If cj = m, then note that we must have m = mi = u, and m,2 = 0. 
With this, Lemma [7J yields B + B x = U(u), and B 2 + B 3 = B 4 + B 5 = B d = 0. Hence, the 
right-hand side of (fl9T i simplifies to jj^j = 1. This proves the desired inequality (TBT i when u = m. 

Also, for small values of uj, it can be checked by direct computation using Lemma [7] that the 
right-hand side of (fT9l ) is at least 1. We used a computer to check this for oj < 16 and all valid 
choices of m, mi and 7712- Here, "valid" means that these quantities must be realizable as the 
number of 1-runs of the appropriate type in a binary sequence x' of Hamming weight co. 

Thus, we may henceforth assume that 1 < m < to — 1 and uj > 17. 

We carry out some more simplifications. The idea is to justify ignoring the terms that involve 



Ba+B 5 



mi and m 2 in the formulae stated in Lemma|7] When we expand out ^ 2 J~_^ -r 
using Lemma [71 we obtain an expression that includes the following terms: 

m(m — mi) {uj — 2m — 3)(m 



+ 



U{uj - 2) 
Re-write this as 

m{m — mi) 



1 



U{uj - 4) 



1 



mi) + m 2 (uj — m){m — mi) + 4{uj 



U(u-6) 
2m + mi) + m 2 



U{uj - 6) 



U(uj - 4) U{uj - 2) 



+ [{uj — m){m 
3(m 



mi) + m 2 \ 



1 



1 



+ 



mi 



+ 



U{uj 

4{uj — 2m + mi) 



6) U{uj - 4) 



U{uj - 4) U{uj - 6) 

The above expression is a sum of four terms, each of which is non-negative. (To see that the last 
term is non-negative, observe that uj > mi + 2(m — mi) = 2m — mi; this is because each 1-run 
counted by mi contains exactly one 1, while the remaining m — mi 1-runs contain at least two Is 
each.) Therefore, the sum + , B T t +B A + JTI B $ R , is at least 

' ' U{u— 2) (7(a)— 4) U(uj— 6) 



{uj — m)(l + m 



U{uj - 2) 
which can also be expressed as 



(?)) , (! + "*)[( 



u>—m\ 
2 



)-{uj-m)] ( UJ ~ m ) -{uj-m){uj-m+l) 



U{uj - 4) 



+ 



{uj 



m 



1 + m + 



(2) 



+ 



i(l + m)(w 



m 



3) , i[( 



Next, we write 



^0 + ^1 

U{uj) 



U{uj - 4) 
U{m) 



+ 



U{uj - 6) 
^ - m) 2 — 9(w — m) - 



U{uj - 6) 



(20) 



U{uj) - U{m) 



1 



|(w — m)(w 2 + wm + m 2 + 5) 



(21) 



12 



NAVIN KASHYAP AND GILLES ZEMOR 



Putting d20b and (|2TT) together, we find that the right-hand side of ( fl9l ) is lower bounded by 

1 + (oj - m)g UJ {m), (22) 



where 



1 + m+CT) i(l + m)(w-m-3) 
= ^77 ^ + 



+ 



C/(w - 2) C/(w - 4) 

g [(w - m) 2 - 9(w - m) — 4] g(u; 2 + wm + m 2 + 5) 



f7(w - 6) J7(w) 

For a fixed uj, consider as a function of m. Some tedious computations (some of which were 
performed with the aid of Maple) show the following: 

• for uj > 6, g u is a convex function, i.e., g'^(x) > for 1 < x < uj; 

• forw > 12, g' u (uj - 1) < 0; 

• for uj > 13, g u (uj — 1) > 0. 

From this, we obtain the fact that, as long as uj > 13, we have g w (m) > for 1 < m < uj — 1. 
Thus, for these values of uj and m, (1221 yields that the right-hand side of (fl9T i is lower bounded by 
1. Recalling that we only needed to show this for uj > 17, the proof of the t = 3 case in Theorem [5] 
is complete. 

5. An information-theoretic upper bound on R(t) 

The method of the previous sections would yield a bound on the asymptotic rate R(t), as defined 
in (0, were Conjectures 13. 1 1 or 14. 1 1 to be proved. Instead, in this section, we use an information- 
theoretic approach to derive an upper bound on R(t). In fact, in Section [6l we sketch an argument 
that indicates that our information-theoretic bound is better than any upper bound on R(t) that can 
be obtained from Conjecture 13. II 

For every even n, by grouping together adjacent coordinates, we can view any code C G {0, l} n 
as a code of blocklength n/2 over the alphabet {00, 01, 10, 11}. Let us say that a binary n-tuple, 
alternatively an nj 2-tuple over the quaternary alphabet, has quaternary distribution (or simply dis- 
tribution) (/oo, /n, /oi, /io) if it h as /oo^/2 symbols 00, fun/2 symbols 11, foin/2 symbols 01 
and /ion/2 symbols 10. We will say that a code has constant distribution if each of its code- 
words has the same quaternary distribution (/oo, /n, foi, fw)- Our goal is to find upper bounds 
on the rate of |Yn] -grain-correcting codes of constant distribution: since the number of possible 
quaternary distributions for a code of length n is 0(ra 3 ), the maximum of these upper bounds on 
constrained codes will yield an unconstrained upper bound. 

Let us introduce the following notation: 

R f (r) = lim sup - log 2 M(n, /, [rn] ) 

n— >-oo Tl 

where M(n, /, t) denotes the maximum cardinality of a i-grain error correcting code of length n 
and constant quaternary distribution /. 

Our strategy is the following: for any given distribution / = (/oo, /n, /oi, /io)> we associate 
to it a discrete memoryless channel (DMC) with input and output alphabets {00, 01, 10, 11} such 
that any infinite family of \rn\ -grain-correcting codes of constant distribution / achieves vanish- 
ing error-probability when submitted through this channel. By a standard information-theoretic 
argument, this implies that the asymptotic rate R of any family of [rn] -grain-correcting codes of 
constant distribution / is bounded from above by half the mutual information between the channel 
input with probability distribution / and the channel output. 

Consider the channel depicted in Figure Q] Let C be a member of a family of [rn] -grain- 
correcting codes of length n and constant distribution /. Suppose that 

(/10 + /01W2 < rn(l-e), 
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FIGURE 1 . A DMC whose effect can be mimicked by grain patterns 

where p is the transition probability shown in Figure Q] When a binary n-tuple, equivalently a 
word of length n/2 over the alphabet {00, 01, 10, 11}, is transmitted over the channel, then with 
probability tending to 1 as n goes to infinity, the number of transitions 01 — > 00 plus the number of 
transitions 10 — > 11 is not more than [rn] . Since these transitions are of the kind caused by grain 
errors, if there are no more than [rn] such transitions, then the errors they cause are correctable 
by any [rn] -grain-correcting code. Therefore, for any e > 0, any family of [rn] -grain-correcting 
codes of constant distribution / can be transmitted over the above channel with vanishing error 
probability after decoding. By a continuity argument we conclude that: 

R f ( T )< l -I{X,Y) (23) 

where X is the channel input with probability distribution p{X) = f, and Y is the corresponding 
output of the channel with parameter 

2r 

P= T — r - (24) 
Jio + Joi 

It remains to compute the mutual information I(X, Y). Since p < 1, (l24l ) implies that we can 
write 

/io + /io = 2r + x (25) 
foo + hi = 1 - 2r - x (26) 
with x non-negative. Now, for every distribution satisfying (l25l and (l26l) we have 

H(Y | X) = (2r + x) h ' _> ' 



2r + x 

where h(-) is the binary entropy function defined by h(£) = — f; log 2 £ — (1 — £) log 2 (l — £), for 
£ G [0, 1]. This implies that I(X, Y) = H(Y) - H(Y \ X) is maximum under the constraints (|25T ) 
and (l26l ) when H(Y) is maximum, i.e. under the distribution: 

P(Y = 10) = P(Y = 01) = -, P{Y = 00) = P(Y = 11) = — -. 



Therefore, we obtain 

fflTKU b(x) - (It 4- x) h ( 

2r + x 



I(X,Y) < l + h(x) - (2r + x)h ( — ^— ) , (27) 
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which together with (1231 ) gives 



1 + K/io + hi - 2r) - (/io + /oi) h 



2r 



/lO + /( 



01 



The right hand side of (l27T i is maximized for x = 1/2 — r, thus yielding the unconstrained upper 
bound stated in the theorem below. 

Theorem 8. For r G [0 ,|], we have 



1 



i*(r)<- 1 + h --r 



1 



2t 

r +r 




Figure 2. The upper bound of Theorem [8] along with bounds from ||3]. 

The upper bound of Theorem [8]is plotted in Figure |2] For comparison, also plotted are the upper 
and lower bounds from [3 Figure 1]. The plots clearly show that the upper bound of Theorem [8] 
improves upon the previous upper bounds, but still remains far from the lower bound plotted. It 
should be pointed that a slightly better lower bound was found by Sharov and Roth Q. Unfortu- 
nately, the improvement is only minor: the lower bound of [5 ] remains above 0.5 only in the interval 
[0, 0.0566], and in that interval, the improvement does not exceed 0.012. 

6. Concluding Remarks 

In this paper, we derived two upper bounds, one on the maximum cardinality, M(n, t), of a 
binary t-grain-correcting code of blocklength n, and the other on the asymptotic rate R(t). In both 
cases, the gap between the upper bound and the best known lower bound remains significant. A 
natural question to ask is whether the putative upper bounds on M(n, t) in Conjectures 13.11 and 14. 1 1 
would yield a better bound on R(t). 

The bound in Conjecture 14. II is the stronger of the two conjectured bounds, and its asymptotics 
are straightforward to analyze. Let M(n,t) = 2 YlZ=o ("J 1 ) vtuj) denote the upper bound in 
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Figure 3. The upper bound of Theorem [8] compared with the asymptotic bound 
obtained from Conjecture I4. II 



([Hi and let R(t) = lim„_).oo \ log 2 M(n, nr). Conjecture |4~T1 implies that R(t) < R(t). By 
standard asymptotic analysis, we obtain 

R(t) = max h(z^) — riiv), 
where rj{v) equals v if v < 2r, and equals v\\{t/v) otherwise. Thus, 

R(t) = max< max h(z^) — u, max h(u) — v\\(t/v) 

I 0<u<2t 2t<v<1 

Using elementary calculus to solve the two maximization problems within the braces in the 
above equation, and comparing the solutions (details of these calculations are omitted), we obtain 
the following: 

R (T) J^"J-""^l^ ^<V 6 (28) 
V ' I h(l/3) - 1/3 if r > 1/6 



where v* = l(r + 1 + Vt 2 — 6r + 1). This bound is compared with the bound of Theorem [8] 
in Figure [3] The plot shows that the conjectured upper bound (l28l) improves upon the bound of 
Theorem[8]for r < 0.214. 

Appendix A 
In this appendix, we prove the following lemma. 
Lemma 9. For t = 2, 3, a pairing satisfying < \13i can be constructed. 

We introduce some convenient notation that will be used in the proof. Notation of the form 

E 

ab — > cd or db — > cd 

will be used to denote the fact that a length-2 grain (from the grain pattern E) acting on a converts 
the substring ab of the derivative sequence x' to the substring cd of y'. 

Case I: t = 2 . Let y £ ^(x). Let ui and m be the number of 1-runs and the Hamming weight, 
respectively, of x', and let Co and m denote the same for y'. Since the action of a grain pattern cannot 
increase the weight of x', we have ui < to. Also, note that a single grain can cause the number of 
1-runs in x' to increase by a most 1 — an increase by 1 happens either when *1100* — > *1010*, 
or when *11 11* — y *1001*. Thus, rh < m + 2. 



16 



NAVIN KASHYAP AND GILLES ZEMOR 



We first show that |$ 2 (y)| > |*2( x )| iff w = w and fh > to. The "if" part is clearly true by 
Proposition [Ub). For the "only if" part, suppose that to < m. Then, since Co < co is always true, we 
have | <I>2 (y ) I < |^2( x )| by Proposition (Ub). On the other hand, if Co < co — 1, then since fh < Co 
is always true (the number of 1-runs cannot exceed the number of Is), we have 

|* 8 (y)| - |<& 2 (x)| = (m - m) + Q - Q < « + ( W ~ ^ - (£j = Co - (co - 1) < 0. 

Hence, |$ 2 (y)| < |$2(x)|. 

Thus, -F 2 (x) = {y G 3>2(x) : Co = to and to > to}. If x has weight co = 1, then there is no 
y G <l?2(x) for which to > m, so that ^(x) = 0. We henceforth consider co > 2. We assume 
y G -^(x), and suppose that E G £ n ,2 is a grain pattern such that y = c/\e(x). Since co = co, 
the grains in E do not act upon non-trailing Is in x'. Let fh = to + a, a = 1 or 2, so that 
| <I>2 (y ) I = | < ^ ) 2(x)| + a. We will construct a z G G 2 (x) with which y can be paired. 

Suppose first that a = 1, i.e., m = m + 1. There is a unique segment *1100* of x' such 

E 

that *1100* — > *1010*. Let j be the position of the trailing 1 affected. Consider the grain pattern 
E' G £ n> 2 that acts instead on the preceding 1, i.e., the position j in E is replaced by j — 1 in E'. Let 

E' 

z = 0#/(x), and note that in x' — > z', the same segment *1100* of x' now undergoes the change 

E' 

*1100* — > *0000*. Thus, the number of 1-runs in z does not exceed the number, m, of 1-runs in 
x'; and moreover, w(z') = co - 2. Hence, |$ 2 ( z )l ~ l $ 2(x)| < ( w ~ 2 ) - (£) = -(2w - 3) < -1, 
since we assumed w > 2 at the outset. 

Thus, we have |$ 2 (y)| = 1*2 (x)| + 1 and | «E> 2 (z) | < | < l ) 2(x)| - 1. With this, we have r^^ji + 

^iyy > So ' we can P air y £ F 2(x) with z. 

Now, suppose that a = 2, i.e., fh = m + 2. There are now exactly two segments of x' such that 

E , 

*1100* — > *1010*. Let E = E — 1 be obtained by replacing each j G E by j — 1, and consider 
z = 0#'(x). Once again, the number of 1-runs in z' does not exceed to, but now, we have co(z') = 
to -A. This time, |$ 2 (z)| - |$ 2 (x)| < ( w 2 4 ) - (£) = -(4w - 10). Note that w must be at least 3, 
since fh = to + 2 is not possible when w = 2. Therefore, |$ 2 (z)| — |$ 2 (x)| < — 4(co — 10) < —2. 
Thus, |$ 2 (y)| = |$ 2 (x)| + 2 and |$ 2 (z)| < |$ 2 (x)| - 2. Hence, + > and 

we can pair y with this z. 

By construction, the pairing y h ^ z is a one-to-one map. 

Case II: t = 3 . Consider any y G ^(x). To the notation introduced above, we add mi and mi 
to denote the number of 1-runs of length 1 in x' and y', respectively. As before, Co < co, but this 
time, fh < to + 3 since x' can be affected by up to three grains. Also, a single grain can cause an 
increase of 2 in mi: *011110* — > *010010* or *01100* — > *01010*. Hence, fh\ < mi + 6. 

Suppose first that Co < co — 2. Then, from Proposition [Tic), we have | < l ) 3(y)| < 1 + (toi + 6) + 
(m,+3)(w-2-3) + 2(( w - 2 ) + ( W 2 2 ) +2(w-2). Upon simplifying, we obtain |$ 3 (y)|- |$ 3 (x)| < 
-{co 2 - 9co + 20) - 2to. Since to > 1, we further obtain |$ 3 (y)| - |# 3 (x)| < -(co 2 - 9co + 22), 
which is a negative quantity. This cannot happen for y G -^(x), so Co must equal co — 1 or co. 

Since Co equals co — 1 or co, we may assume that y = 0^(x), for some grain pattern E G £ n> s 
that acts only upon the trailing Is in x'. Let Ei, i = 1, 2, 3 be the subset of E consisting of grains j 
that act on the trailing Is of 1-runs of length i; also let E^ = E \ {E\ U E^ U E$) be the subset of 
E acting on the trailing Is of 1-runs of length at least 4. Set a = \Ei\, i = 1, 2, 3, 4. It is easy to 
see that to < m + e 2 + e 3 + e^, while fh\ < mi + 2e 2 + e 3 + e4. 

Let d = e 2 + e 3 + e4. If d = 0, then fh < to and toi < mj. Since w < co always, we have 
|*3(y)| < | ^3 (x) I by Proposition [TJc), which is not possible for y G -^(x). 

At this point, we have that Co equals co — 1 or co, and d equals 1, 2, or 3. We will now construct a 
z to be paired with y. Let E' = E\ U {j — 1 : j G E<i U E% U ^j. Thus, £" is a grain pattern in 
£ n 3 that retains the grains from E that act on trailing Is from 1-runs of length 1, but pushes back 
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all the other grains in E by one position. Let z = <^/(x). We claim that the desired inequality 



+ 



l$ 3 ( z )l > |$3(x)| n °lcl s - The remainder of this proof justifies this claim. 



It is enough to show that |[|* 3 (y)| + | «E>3 (z) | ] < | ^3 (x) | , since by the concavity of the function 
f(x) = 1/x, we would then have 



1 1 

+ 



1 1 

> > 



r[|*3(y)| + |*3(z)|] " l$3(x)|' 



1*3 (y) I l*s(z) 

To this end, note first that 

|* 3 (y)| - |*s(x)| < 2e 2 + e 3 + e 4 + (w - 3)d. (29) 
Next, we bound |5>3(z)| — | ^3 (x) | . It is not difficult to check that w(z') = u — 2d, the number 
of 1-runs in z' is at most m, and at most mi + e% of these are of length 1. Thus, | ^3 (z) | < 

1 + (mi + e 3 ) + m{u) - 2d - 3) + ( w ~ 2d ) - + 2(w - 2d), and hence, 

|$ 3 (z)|-|$ 3 (x)| <e 3 -2md-£(w,d), (30) 
where £(u, d) = (3) - (£) + 2u - [{ ul ~ 3 2d ) - +2(u- 2d)] . From $29$ and ©, we obtain 

-[|$ 3 (y)| + |$ 3 (z)|]-|$ 3 (x)| < e 2 + e 3 + ^e 4 + ^(u}-3)d-md- -^(00, d) 

< d+ I( w _3) d _ d 2_ ^( w>d ) > (3i) 

where we have used the fact that m> d, which is simply the observation that IE2U.E3U.E4I cannot 
exceed the number of 1-runs in x'. 

If d = 1, the expression in (OTI ) reduces to — \{oo 2 — 7uj + 14), a negative quantity. If d = 2, we 
obtain — (oj 2 — 9oj + 24) instead, which is still a negative quantity. If d = 3, we get — |(o; 2 — 11a; + 
i^p), which is also a negative quantity. Thus, in all cases, we have |[| < l ) 3(y)| + |<I ) 3(z)|] < |<& 3 (x)| 
as desired. □ 
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